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Abstract 


-  Let  Xi  be  a  semimartingale  which  is  either  continuous  or  of  counting  process  type  and  which 
satisfies  the  stochastic  differential  equation  dXt  =  Yt  a(t,  Zt)  dt  +  dMt,  where  Y  and  Z  are  pre¬ 
dictable  covariate  processes,  M  is  a  martingale  and  xx  is  an  unknown,  nonrandom  function.  We 

study  inference  for  a  by  introducing  an  estimator  for  A(t,z)  =  f*f*a(s,x)dsdx  and  deriving 
•  •  |  •>  * 

a  functional  central  limit  theorem  for  the -estimator.  The  asymptotic  distribution  turns  out  to 
be  given  by  a  Gaussian  random  field  that  admits  a  representation  as  a  stochastic  integral  with 
respect  to  a  multiparameter  Wiener  process.  This  result  is  used  to  develop  a  test  for  independence 
of  X  from  the  covariate  Z ,  a  test  for  time-homogeneity  of  a,  and  a  goodness-of-fit  test  for  the 
proportional  hazards  model  a(t,z)  =  Ofi(t)  <&(*)  used  in  survival  analysis.  ;  .... _ - 
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1.  Introduction 


Consider  a  nonlinear  semimartingale  regression  model  in  which  a  process  X  is  related  to  a 
covariate  process  Z  by 


Xt=X0+  f 
Jo 

Xt  =  Yt  a(t,  Zt), 


da  +  Mt , 


where  a  is  an  unknown,  bounded,  deterministic  function,  M  is  a  martingale  and  Y  is  an  indicator 
process,  taking  the  value  1  when  A'  and  Z  are  under  observation,  zero  otherwise.  In  the  case  that 
X  is  a  counting  process,  A  and  a  are  called  the  intensity  process  and  conditional  hazard  function 
respectively.  If  the  intensity  process  is  of  the  form  A <  =  a(t)  Zt,  we  have  Aalen’s  (1978)  multiplica¬ 
tive  intensity  model,  for  which  a  well  developed  theory  of  hazard  rate  and  integrated  hazard  rate 
estimation  exists  (see  the  survey  article  of  Andersen  and  Borgan,  1985).  For  the  nonlinear  model 
(1.2)  an  estimator  A(*,z)  of  the  time-integrated  conditional  hazard  function  A(t,  z)  =  J*  a (s,z)ds 
at  a  fixed  level  z  of  the  covariate  Z  has  been  studied  by  Beran  (1981)  and  Dabrowska  (1987)  in  the 
survival  analysis  setting,  and  by  McKeague  and  Utikal  (1987)  in  the  general  case.  This  estimator 
was  used  to  develop  methods  of  inference  for  the  function  a(*,z)  at  fixed  z,  based  on  observation 
of  i.i.d.  replicates  of  (X,Y,Z). 

In  the  present  paper  we  study  inference  for  the  entire  conditional  “hazard”  function  a(-,  •). 
For  that  purpose  we  introduce  the  estimator 


A[t,z)  =  f  A(t,x)dx 
Jo 


of  the  time  and  state  integrated  hazard  function 


A(t,z)  =  I  I  a[a,x)dadx 

Jo  Jo 


When  AT  is  a  continuous  process  or  a  counting  process  Theorem  3.1  gives  the  weak  convergence  of 
the  appropriately  normalized  time  and  state  indexed  process  A  to  a  Gaussian  random  field.  This 
is  proved  by  using  the  results  of  Bickel  and  Wichura  (1971)  to  establish  tightness.  Convergence 
of  the  finite  dimensional  distributions  is  shown  using  Rebolledo’s  (1980)  martingale  central  limit 
theorem. 

In  Section  4.1  we  propose  a  test  for  independence  of  X  from  the  covariate  process  Z.  Here 
independence  from  the  covariate  means  that  a  is  only  a  function  of  time.  A  natural  estimator  for 
A  under  the  hypothesis  of  independence  is  given  by  A(t,  z)  =  z  A(t),  where  A  is  the  Nelson-Aalen 
estimator.  We  derive  ihe  asymptotic  distribution  of  A  -  A  in  Theorem  4.1  and  show  that  a  maximal 
deviation  statistic  based  on  A  —  A  yields  a  consistent  test  for  independence. 

In  Section  4.2  we  propose  a  test  for  time-homogeneity,  i.e.  that  a  =  a(t,z)  does  not  depend  on 
time  t.  An  estimator  for  A  under  the  hypothesis  of  time-homogeneity  is  given  by  A*(t,z)  =  t  A(l,z). 
A  maximal  deviation  test  statistic  based  on  A  —  A*  is  shown  to  yield  a  consistent  test  for  time- 
homogeneity. 

In  Section  4.3  we  develop  a  goodness-of-fit  test  for  the  “proportional  hazards”  model  a(t,z)  = 
ai (0 a2(z)»  where  ari(t)  and  ot2(z)  are  arbitrary  unknown  functions.  This  model  has  been  studied 
by  Thomas  (1983),  Tibshirani  (1984),  Hastie  and  Tibshirani  (1986)  and  O’Sullivan  (1986a,  1986b) 
in  the  survival  analysis  context  (where  it  is  a  generalization  of  Cox’s  (1972)  proportional  hazards 
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model).  These  authors  propose  various  estimators  for  the  log  relative  risk  function  log  or2 ,  where 
c*2  is  assumed  to  be  positive,  but,  except  for  O’Sullivan  (1986a),  who  finds  a  rate  of  convergence 
for  his  estimator,  they  do  not  provide  any  asymptotic  theory.  We  introduce  A(l,*)  as  an  estimator 
of  the  cumulative  relative  risk  function  A2(-)  =  /0  a2(x)dz  and  find  its  asymptotic  distribution. 
Technical  lemmas  used  in  the  proofs  of  our  main  results  are  given  in  Section  5. 

2.  Preliminaries 

Let  (fl,  7,  P)  denote  a  complete  probability  space,  (Tt,t  €  [0,1])  a  nondecreasing,  right- 
continuous  family  of  sub-u-fields  of  7,  where  To  contains  all  P-null  sets  in  7.  All  processes  are 
indexed  by  t  €  [0,1].  The  process  (Aft,  7)  is  assumed  to  be  a  zero-mean  L4-martingale  with 
sample  paths  in  Skorohod  space  D\ 0, 1].  The  quadratic  characteristic  of  M  will  be  denoted  by  ( M ) 
and  its  quadratic  variation  by  [M\.  The  processes  Y  and  Z  are  assumed  to  be  predictable,  with 
Y  an  indicator  process.  For  simplicity,  Z  is  supposed  to  be  scalar  valued.  The  processes  X ,  Y,  Z 
and  M  are  related  by  (1.1)  and  (1.2)  which  can  be  written  in  the  form 


dXt  =  Yt  a(t,  Zt)  dt  +  dMt. 


(2.1) 


We  assume  that 


(M)t=  f \(t,Z„Y.)ds ,  (2.2) 

Jo 

where  7  is  a  bounded,  measurable  function.  Note  that  if  X  is  a  counting  process  we  have 


7(*.*,y)  =  <*(t,z)y. 


(2.3) 


Let  W  =  ( W (t,  z),  (t,  z)  G  [0,  l]2)  be  a  two-parameter  Wiener  process,  i.e.  a  Gaussian  process 
with  zero  mean  and  EW(t,z)  W(t',z')  =  min(t,t')  min^,^).  Let  /*  <t>[s,x)  dW{a,  x)  denote  a 
continuous  version  of  the  Wiener  integral  of  a  function  <f>  €  L2([0,  l]2,  ds  dx)  defined  by  Ito  (1951), 
Wong  and  Zakai  (1974)  and  Bass  (1988).  The  estimators  and  test  statistics  that  we  shall  introduce 
have  asymptotic  distributions  which  can  be  represented  in  terms  of  stochastic  integrals  of  this  type. 

We  make  the  following  assumptions: 

(Al)  For  each  t,  the  random  vector  (Zt,Yt)  is  absolutely  continuous  with  respect  to  the  product 
of  the  Lebesgue  and  counting  measure.  Denote  the  corresponding  density  by  fz{t)Y(t)(z>  !/)• 

(A2)  fz(t)Y(t){z,  1)  is  bounded  away  from  zero. 

(A3)  fz(t)Y{t)(z >1)  is  continuous  as  a  function  of  t  and  z. 

(Bl)  7(t,  z,y)  is  a  continuous  function  of  t  and  z  for  each  fixed  y. 

(B2)  a  is  Lipschitz,  i.e.  there  exists  a  constant  K  such  that 


|a(ta,  zx)  -  a(t2,  *2)  i  <  Ky/fa-  t2 )2  +  (zi  -  *2)2 


for  all  ti,t2,zi,z2. 


Let  C2  =  £■([(),  l]2)  denote  the  space  of  continuous  functions  on  the  unit  square  equipped 
with  the  supremum  norm  [j  •  |[.  Let  denote  the  extension  of  the  space  D  [0, 1]  to  functions  on 
[0,1]2,  as  described  in  Neuhaus  (1971). 


3.  Estimation  of  A 


For  simplicity  we  restrict  the  region  over  which  A  is  to  be  estimated  to  [0,  l]3.  For  each 
n  >  1,  let  ,  ij")  be  the  partition  of  the  interval  [0, 1]  defined  by 

^  =  llT’  r=1>  -  >dn~1 


jl")  -  [l  _  J_  1 
ld'  l1  d’1]’ 


where  d„  is  an  increasing  sequence  of  positive  integers.  The  superscript  n  will  usually  be  sup¬ 
pressed  in  the  notation,  for  example  we  shall  write  Ii,...,IdK  instead  of  I^n\...,J^nK  Let 

( Xi ,  Yi,  Zi,  Mi),  i  =  1 . n  denote  copies  of  the  generic  processes  defined  above,  where  only  M, 

is  not  observable  and  the  corresponding  filiations  are  independent.  Define 


(3.1) 

(3.2) 


xin)(t)  =  £  f*  /{*(#)  e  Ir)Yi(s)  dXi(s), 
i=  1  Jo 

Yr^(s)  =  ±I{Ms)eJr}Yi(s), 

i=l 

=  [  ~<!j~:dX{rn)(8)’  for  zeJr> 

Jo  Yr  (») 

where  1/0  =  0.  Since  A(t,z)  =  /0*  A(t,x)  dx,  we  propose  to  estimate  A  by 

A(t,z)  =  f  A{t,x)dx. 

Jo 

The  asymptotic  distribution  of  A  is  given  by  the  following  result. 

THEOREM  3.1.  Suppose  that  A1-A3,  Bl,  B2  hold,  dj^/n  — ♦  oo,  dn  =  o(n6)  for  some  8  6  (1/2,  1) 
and  A  is  a  counting  process  or  has  continuous  sample  paths.  Then 

\fn{A  -  A)^*m 

in  D2  as  n  — ►  oo,  where 

[  f  y/h(a,x)dW(a,x), 

Jo  Jo 

h(B  rr\  -  ,y(8»a:»1) 

fz(,)Y(,)(  1,1) 

REMARK.  The  process  m  is  a  continuous  Gaussian  random  field  with  mean  zero  and  covariance 
function 

rtiAtj 

Cov(m{ti,zi),  m(t2,Z2))  =  /  h{a,x)dsdx. 

Jo  Jo 

Proof  of  Theorem  8.1.  Define 

dMr<">(o)  -  £  I{Zi(a)  €  Jr}y;(s)  dM,(«),  (3.4) 

t=l 

R 

c*<n)(«)  =  €  J,}y<w«(*,«w). 


(3.5) 


Then  by  (2.1),  (3.1) 


Also  define  the  processes 


dX^{s)  =  a<")(s)  da  +  dM <">  (s). 


J'  J‘  ^±-iMl’\.)Hze  l,)dx,  (3.7) 

4(i.  «j=E  J'f  fSn  * /(* e  J')  *•  (3-8) 

r=1^o  yr'  '(«) 

M(n)  (t,  z)  =  ^  [*  1  dM W  (•).  (3.9) 

V  '  dn  Jo  Yr{n){a)  W  ' 

Here  and  in  the  sequel,  any  summation  over  r  =  1, . . . ,  [ zdn\  is  defined  to  be  zero  when  [zdn\  =  0. 

A  A  A  p 

Now  y/n{A  —  A)  =y/n(Ap  —  A)+M.  Lemma  1  gives  y/n\\Ap  -  ^|j— *0.  To  complete  the  proof  we 
need  to  show  that  Ml")-^>m  in  Z>2. 

Suppose  that  M'n'—*m  in  D 2.  Define  a  linear  map  ?rn:.D2  — ►  D2  by  irn(f)(t,z)  = 
f(t,  zr-i)+dn{z— zr-i)/(t,zr)  for  z  G  Jr,  where  zr  =  r/dn.  Here  7r„(/)(t,  ■)  is  a  piecewise  linear  ap¬ 
proximation  to  based  on  the  points  zT,  r  =  1, .. .,  dn,  for  each  t.  Note  that  Ml”)  =  v „(Ml")). 

Also,  appealing  to  a  D2  version  of  Lemma  4.1  of  McKeague  (1988),  we  have  xn(Ml"))-^m  in  D2, 
where  we  have  used  the  fact  that  m  has  its  sample  paths  in  C2.Thus  Ml")-^m  in  D2.  All  that 
remains  to  be  proved  is  that  Ml")  — *  m  weakly  in  Z>2.  This  will  be  established  by  showing  that 
{Ml"),  n  >  1}  is  tight  in  D2  and  the  finite  dimensional  distributions  of  Ml")  converge  weakly  to 
those  of  m.  _ 

^  Denote  the  increment  of  Ml")  over  the  rectangle  (s,  t]  x  (x,  y]  by  Ml")  ((«,*]  x  (x,  y])  = 

Ml")(t,  y)  —  Ml")(s,y)  —  Ml")(t,  x)  +  M!")(«,z).  Tightness  is  established  by  checking  some  prod¬ 
uct  moment  conditions  of  Bickel  and  Wichura  (1971)  for  the  increments  of  Ml")  over  certain 
neighbouring  rectangles: 

E ( Ml”) ( (s, t]  x  (x,y]))2(Ml")((s,i]  x  (y,z)))2  <  K  {t-a)*{y-x)  (z  -  y) 

and 

E  (  M<">(  (s,  t)  X  (x,  y] )  )2(  M<">(  (t,  u]  x  (*,  yj )  )2  <  K  (t  -  s)*  (u  -  t)  (y  -  xf . 

This  is  done  in  Lemmas  2  and  3. 

To  show  convergence  of  all  finite  dimensional  distributions  it  suffices  to  show  that  for  any 
0  <  *o  <  •  ••  <  zP  <  1)  P  >  1 

(M(-,zy)  -  MO.zy-O^j-^mf-.zy)  -  m(-,ry_i))J=1 

in  D[0,  l]p,  where  £)[0,  l]p  is  the  product  of  p  copies  of  Z?[0, 1].  This  can  be  done  using  a  p- 
variate  version  of  Rebolledo’s  (1980)  martingale  central  limit  theorem,  as  given  by  Aalen  (1977) 
and  Andersen  and  Gill  (1982,  Theorem  1.2)  in  the  counting  process  case.  The  processes  M(-,zy)  — 
M(*,zy_i),  j  =  1,.. .  ,p  are  orthogonal  square  integrable  martingales  and  by  Lemma  4 

(M(-,Zj)  -  -  m(>,zJ_1))t, 
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for  each  i,  j  =  1, . . .  ,p.  That  completes  the  proof  for  the  continuous  case.  In  the  counting  process 
case  we  also  need  to  check  the  Lindeberg  condition  (cf.  Andersen  and  Gill’s  (1.4)  with  /  =  r,  *  =  j, 
n  =  dn) 

d\  r1  -  >  _  i  .  r. 

(3.10) 


for  all  e  >  0,  where 


El  ^W^d^WI  >  e}d(MW).Z0, 

H("]{s)  =  (  y,(nV)  if  <  r  - 

,T  ( 0  otherwise, 


This  is  done  in  Lemma  6. 


Confidence  sets  for  A 

In  order  to  apply  Theorem  3.1  to  obtain  Kolmogorov-Smirnov  type  confidence  sets  for 
A  of  the  form  {A:  -^/nsupj  x  [^(t,z)  -  >l(t,z)|  <  c)  we  would  need  to  determine  the  quantiles  of 
t  =  sup*  x  |m(t,z)|.  In  the  time-homogeneous  case,  considered  below,  it  is  possible  to  use  existing 
tables.  In  the  general  case,  the  representation  of  m  in  terms  of  the  Brownian  sheet  process  W  gives 
a  way  to  obtain  such  quantiles  by  simulation.  We  shall  only  consider  this  in  the  counting  process 
case,  but  the  continuous  case  is  similar.  First  estimate  the  function  H(t,  z)=  f0  /0*  h(s,  x)  dx  da  by 

!*<*»!  ,< 


H{t,z)  =  ~  E  f  —A — dxl [nH*) 

V  <%  fr[  Jo  (Yr{n)(s))2 


and  then  estimate  h  by 


where  K  is  a  bounded,  nonnegative  kernel  function  with  compact  support,  integral  1  and  bn  is  a 
bandwidth  parameter,  bn  — ►  0.  The  following  result,  which  is  proved  in  Lemma  10(a),  shows  that 
h  is  an  L2-consistent  estimator  of  h. 

PROPOSITION  3.2.  Suppose  that  X  is  a  counting  process,  the  assumptions  of  Theorem  3.1  hold, 
d„62  — ►  oo  and  K  is  Lipschitz.  Then  E f*  f*  |h(t,z)  -  h(t,z)\2dtdz  — *  0. 

The  process  m,  with  h  in  place  of  h,  could  then  be  simulated  to  obtain  approximate 
quantiles  for  r.  Using  Proposition  3.2  it  can  be  shown  (cf.  the  proof  of  Proposition  4.2)  that  this 
procedure  leads  to  asymptotically  correct  confidence  sets  for  A. 


Confidence  bands  for  the  integrated  hazard  of  a  time-homogeneous  counting  process 

Let  N  be  a  counting  process  which  is  time-homogeneous  in  the  sense  that  its  conditional 
hazard  function  only  depends  on  the  covariate  process  Z,  so  N  has  intensity 


A  «  =  y,a(Z(). 


(3.11) 


An  estimator  of  A (z)  =  JQ  a(x)  dx  from  i.i.d.  copies  (Nit  Yit  Zi),i  =  1,.. .  ,n  of  ( N,Y,Z )  is  given 
by 


where 


A(z)=  [X  A(x)dx, 

Jo 

for  x  €  Ir, 

JO  Yr  [8) 


(3.12) 


and  is  defined  by  (3.1)  with  X  replaced  by  N.  To  apply  our  result  to  this  special  case  we 
note  that  the  projection  3r:D2  — ►  £>[0,  l]  defined  by  jt (/)  (z)  =  /(l,  z)  is  continuous,  so  by  the 
continuous  mapping  theorem  (Billingsley,  1968,  Theorem  5.1)  we  obtain  the  following  consequence 
of  Theorem  3.1.  A  similar  result  could  be  obtained  in  the  case  that  X  has  continuous  sample 
paths. 

PROPOSITION  3.3.  Suppose  that  A1-A3,  Bl,  B2  are  satisfied,  d^/n  — ►  oo  and  dn  =  o(n*)  for 
some  8  €  (1/2,  1).  Then,  for  A  defined  by  (3.12), 

y/n(A  -  A)^*m 

in  C[0, 1]  as  n  — *■  oo,  where  m  =  (m(z),  z  €  [0, 1] )  is  a  continuous  Gaussian  martingale  with  mean 
zero  and  covariance  function  Cov(m(zi),m(z2) )  =  H(zi  A  z2),  where 


*(»)-  /'  [' ,  a(x\ 

Jo  Jo 


With  the  help  of  Proposition  3.3  we  now  construct  confidence  bands  for  A.  Denote 

-  1*^1  rl 


-  r1  i 

H{z)  =  -jj  I  — F^r - dN^(s). 

'Zfr'Jo  (n( ^(s))2 


As  a  consequence  of  the  proposition, 

^ _ y/Hi1)  _  Ai.))£,w of  H(’)  \ 

in  C[0, 1]  as  n  — ♦  oo,  where  W°  is  a  standard  Brownian  bridge.  Now  H  is  a  uniformly  consistent 
estimator  of  H  by  Lemma  9.  Thus  we  obtain  the  following  asymptotic  100  (1  —  a)%  confidence 
band  for  A: 


WW+18)-  "M- 


where  P(sup0<t<1y2  |W°(t)|  >  ca)  =  a,  0  <  a  <  1.  A  table  for  ca  can  be  found  in  Hall  and 
Wellner  (1980j! 


4.  Goodness-of-fit  tests 


4-1.  Testing  for  independence  from  the  covariate  process 

In  this  section  we  consider  the  problem  of  testing  whether  the  covariate  process  Z  is 
absent  from  the  model,  i.e.  whether  a  is  only  a  function  of  time.  Let  Ho  denote  the  null  hypothesis 
Ho‘Ot{t,zi)  =  a(t,Z2)  for  all  t,zi,Z2  €  [0,1].  Under  Ho  the  natural  estimator  of  A  is 


A(t,z)  =  z  A(t), 


where  A  is  the  Nelson- Aalen  estimator 


A(t)  =  f 
Jo 


*  dX^(s) 


*(n)(0  =  J2  f  I{Zi{s)  e  [0,1])  Yi($)  dXi(s), 
,= l  Jo 

y(n)(s)  =  ^/(^(s)e  [0,1]) y,(s). 


Define  some  functions  g  and  p  by 


$(*»*)  =  7(M>1)  /z(«)y(«)(*,l)/*>2(<)> 

p(t)  =  P(0  <  Z{t)  <  1,  Y{t)  =  1)  =  f  fz(t)  Y(t)  (*,  1)  dx. 

Jo 

The  following  result  gives  the  asymptotic  distribution  of  A  —  A. 
THEOREM  4.1.  Under  the  conditions  of  Theorem  3.1,  if  Hq  holds  then 


y/n(A  -  A)-^*mo 


in  D2  as  n  — *  00,  where 


m0(t,z)=  f  f  \/h(s,x)  dW(s ,  x)  -  z  [  f  s/g{s,x)  dW{s,x). 

Jo  Jo  Jo  Jo 

The  Kolmogorov-Smirnov  type  test  statistic  =  y/n  supt  x  \A(t,z)  -  A(t,z)\  could  be 
used  for  testing  Ho-  Note  that  the  continuous  mapping  theorem  and  Theorem  4.1  imply  that 

|mo(t,z)|  as  n  — »  00.  In  order  to  construct  an  asymptotic  size  a  test  of  Ho,  rejecting 
H0  if  is  large,  we  first  need  to  introduce  appropriate  estimators  for  the  functions  g  and  h 
under  Ho-  Again  we  shall  only  do  this  in  the  counting  process  case.  Let 

Iff: I  r*  v(n)f0\ 


n  W  dX^js) 


ft 


(n)(s)y  (")(«) 


m 


wV.va’.vw; 


and  define 


°kL  L  k 

where  /T  is  a  bounded,  nonnegative  kernel  function  with  compact  support,  integral  1  and  bn  is  a 
bandwidth  parameter,  b„  — ♦  0. 

The  distribution  of  T  =  supt  x  |m0(t, z)\  depends  only  on  6  =  ( g,h )  and  is  continuous, 
see  Ylvisaker(1968).  Let  ca(0)  denote  the  upper  a-quantile  of  T,  so  that  Pg{T  >  ca(0)}  =  a  for 
0  <  a  <  1.  Given  the  estimate  6n  =  ( g,h ),  we  may  simulate  the  process  mo,  with  g  and  h.  in  place 
of  g  and  h  respectively,  to  obtain  an  approximate  critical  level  =  ca(6n).  In  Proposition  4.2 
we  show  that 

lim  P(T<n)  >  )  =  a. 

n — *•  oo 

Thus,  rejecting  Ho  when  yields  an  asymptotic  size  a  test  for  independence.  In 

Proposition  4.3  we  show  that  this  test  is  consistent  against  all  alternatives. 

Proof  of  Theorem  f.l. 

Decomposing  /  in  a  similar  way  to  A  in  the  proof  of  Theorem  3.1,  we  can  write 


y/n(A  -  A){t,z)  =  M[t,z)  -  zM(t)  +  \/n(Ap  -  Ap)(t,z), 


where 


and  under  Hq 


Af(n)(0  =  v^E 

w  t'Jo  Y(»)(s)  ’ 

Ap(t,z)  =  z  f  a{s)I{Y^{s)>0)ds. 

Jo 


Putting  dn  =  1,  k  =  1  in  Lemma  4  of  McKeague  and  Utikal  (1987),  we  obtain  y/n\\Ap  -  ^||^0 
under  Ho.  Also,  by  Lemma  1,  %/n||^p-  ^||^+0.  Therefore  ~  /p||^ *0  under  H0.  To  complete 

the  proof  it  suffices  to  show  that  £-^*m0,  where  £(t,z)  =  M(t,z)  -  z  M[t).  Set 


»(*)=/  f  Vg{s,x)  dW(s,x), 
Jo  Jo 


where  W  is  the  same  Brownian  sheet  used  to  define  m  in  Theorem  3.1.  Then  m  is  a  zero  mean 
continuous  Gaussian  martingale  with  predictable  variation  process 


(m)t  =  /  /  g(s,  z)  dx  ds. 
Jo  Jo 


Suppose  that  (M,  M)— *(m,  m)  jointly  in  D2X  D[0, 1].  Define  a  map  n’n:  £>2  x  D[  0, 1]  by  n'n(fi,f2)  = 
(*n(/i)»  {2),  where  is  defined  in  the  proof  of  Theorem  3.1.  Then,  as  in  that  proof,  (M,M)  = 
ir 'n(M,  jointly  in  Z?2  x  D[0, 1]  and,  since  m  and  m  have  continuous  paths,  by  the 


continuous  mapping  theorem  we  may  conclude  that  £  converges  weakly  to  the  process  m(t,z)  - 
zm(t)=  m0(t,z). 

V 


It  remains  to  show  that  ( M,M)—*[m,m )  jointly  in  D2  x  _D[0,1].  The  process  M  is  a 
martingale  and  { ,  by  Lemma  9(a).  The  Lindeberg  condition  (3.10),  with  p  =  1  and 
=  V*/Y{n)(  s)  can  be  checked  as  in  the  proof  of  Lemma  6.  Therefore,  by  Rebolledo’s 
martingale  central  limit  theorem,  M-^m  in  D[0, 1).  Also,  by  the  proof  of  Theorem  3.1,  we  have 
M—*m  in  D 2.  If  we  can  show  that  the  finite  dimensional  distributions  of  (M,  M)  converge  to 

those  of  (m,m),  then  (M,  M)-^(m,m)  jointly  in  D2  x  Z)[0, 1]  and,  since  m  and  m  have  continuous 
paths,  by  the  continuous  mapping  theorem  we  may  conclude  that  £  converges  weakly  to  the  process 
m(t,z)  -  zm(t)=  mo(t,z). 

To  show  that  the  finite  dimensional  distributions  of  ( M,M )  converge  to  those  of  (m,m), 
it  suffices  to  show  that  for  any  0  <  zq  <  z\  <  •  •  •  <  zp  <  1,  p  >  1, 


((M(.,zy)  -  M(.,zJ_a))Jp=1,M(.))-^((m(.,zJ)  -  m^.O^.mO)) 


in  D[0,  l]p+1.  This  is  done  using  Rebolledo’s  martingale  central  limit  theorem,  as  in  the  proof  of 
Theorem  3.1.  It  only  remains  to  consider  the  covariation  between  M(-,z)  and  M(-).  By  Lemma 
9(b) 


(M(-,  z),  M(-))t-»(m(-,  z),  m(-))t, 


for  each  z.  There  are  p  +  1  Lindeberg  conditions  to  check.  But  these  conditions  have  already  been 
checked  separately  for  the  p  components  involving  M  and  the  one  component  involving  M.  This 
completes  the  proof. 


PROPOSITION  4.2.  Suppose  that  A"  is  a  counting  process,  the  assumptions  of  Theorem  3.1  hold, 
dnb2  — >  00  and  K  is  Lipschitz.  Then  if  H0  holds,  for  all  0  <  a  <  1 


lim  P(T<" >  >  cj,">)  =  a. 


Proof.  Let  0  denote  the  space  of  all  functions  of  the  form  8  =  ( g,h )  with  g  and  h  nonnegative 
bounded  functions  on  [0,  l]2,  and  endow  it  with  the  product  metric  from  L2{  [0,  l]2,  dsdx)  x 
L2(  [0,  l]2,  dsdx).  Let  8n  =  (gn,  hn),  n  >  1  be  a  sequence  in  0  such  that  8n  —*  0.  Then  yfgj,  — ►  y/g 
and  y/h^  —*  y/h  in  L2([0,  l]2,  dsdx).  An  argument  using  Doob’s  inequality  applied  twice  (cf. 
Cairoli  (1970)  and  Bass  (1988))  shows  that  if  ^  £  L2([0,  l]2,  dsdx),  then 


Esup\  f  f  <^(s,i)diy(s,i)  <  16  [  f  <f>2(s,x)  ds  dx. 

t,z  'Jo  Jo  Jo  Jo 


Applying  this  inequality  to  <p  =  y/9^  ~  y/H  and  <f>  =  y/h Z  ~  Vh  gives  Fg^Fg,  where  Fg  i 


is 


the  distribution  function  of  T  under  Pg.  Let  Fg  1  denote  the  left-continuous  inverse  of  Fg.  By 


Billingsley  (1986,  p.343)  we  get  ea(0„)  =  Fg/(  1  -  a)  — ►  Fe  *(1  -  a)  =  ca(0),  provided  Fe  1  is 
continuous  at  1  -  a.  Now  by  Lemma  10(b)  we  have  in  the  metric  of  0.  Thus,  using  a 


subsequence  argument,  ca(8n)-+ca(0)  and 

Pe(T(n)  >  ca(0n))  Pe{T  >  ca{8))  =  a, 
9 


(4.1) 


m 


v 

• r 1 


m 


s* 


m 


a 
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$ 


>3 


3 


V“4 


Vi 
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for  all  but  countably  many  a,  where  we  have  used  Slutsky’s  theorem  and  the  continuity  of  Fg.  Since 
Pg(T ("1  >  ca(0n))  is  a  nondecreasing  function  of  a  it  follows  that  (4.1)  holds  for  all  0  <  a  <  1, 
completing  the  proof. 


PROPOSITION  4.3.  Under  the  assumptions  of  Theorem  3.1,  if  Ho  does  not  hold  then  T^—*oo 
as  n  — »  oo. 

Proof.  First  note  that  if  Ho  does  not  hold  then  \\A  —  ^o[|  >  0,  where 

A0(t,z)  =  z[  -pr  /  a(s,  x)  fz{»)  y(») [x,  1)  dx  ds. 

Jo  P\8)  Jo 

Also  note  that  by  Doob’s  inequality  and  Lemma  7  we  have  F?||M^n^||2  =  O(l),  where  M is 
defined  in  the  proof  of  Theorem  4.1,  and  using  similar  arguments  to  the  proof  of  Lemma  7 

E\\AP  -  .doll2  <  8up£  ~~ph)J0  “(5>a:)/^(«)i,(«)(:c>1)d:c  =°(")> 


where 


/p(t,  z)  =  z  jf  da  and  a(n)  (s)  =  ^  a<n>  (e). 

Thus,  since  ~  i||  =  Op(l)  by  Theorem  3.1, 

V^\\A  -  >lo||  <  sfa\\A  -  i||  +  T(n)  +  ||M^||  +  V^|| AP  ~  Ao\\  =  T(n)  +  Op(1). 
This  shows  that  T^-^* oo  if  Ho  does  not  hold. 


REMARK.  The  above  test  for  independence  can  be  modified  to  provide  a  goodness-of-fit  test  for 
Aalen’s  multiplicative  intensity  model.  Now  Ho  is  the  null  hypothesis  H0:  there  exists  a  function 
ao:  [0, 1]  — +  R  such  that  a(t,z)  =  Oo(t)  z  for  all  t,  z  £  [0,  l).  Under  this  Hq,  the  natural  estimator 
of  A  is  A(t,z)  =  5 z2A(t ),  where  A  is  the  Nelson-Aalen  estimator  as  before,  except  that 


y<">(s)  =  £/(Z,(s)  €  [O.lDK^sjZ^s). 
»=1 


The  only  changes  to  Theorem  4.1  are  that 


p{t)=  [  x)Z(t)Y(t){x,\)dx 
Jo 


4.2.  Testing  for  time-homogeneity 


We  want  to  derive  a  test  for  the  hypothesis  Ho'Ot{t\,z)  =  a(*2>*)  for  all  tj,  *2>  z  £  [0,1], 
i.e.  a  is  only  a  function  of  the  covariate.  One  possible  application  of  such  a  test  would  be  in  testing 
whether  a  pure  jump  process  on  a  finite  state  space  is  a  Markov  renewal  process,  see  McKeague 
and  Utikal  (1987,  Section  1).  The  natural  estimator  for  A  under  Ho  is  Am(t,z)  =  tA(l,z).  In  order 
to  test  Ho  we  could  use  the  test  statistic  S ^  =  y^isupt  t  lA(t,z)  —  A*(t,z)\.  As  in  Section  4.1, 
once  we  know  the  asymptotic  distribution  of  y/n(A  —  A*)  we  can  derive  an  asymptotic  size  a  test 
for  Ho  based  on  5^"i.  This  test  can  be  shown  to  be  consistent  using  a  proof  similar  to  that  of 
Proposition  4.3.  The  asymptotic  distribution  of  y/n(A  -  A *)  is  given  by  the  following  theorem. 

THEOREM  4.4.  Under  the  conditions  of  Theorem  3.1,  if  Hq  holds  then 


A*)2*mi 


in  Z>2  as  n  — ►  oo,  where 


j  f  y/ h(8, x)  dW (s, x)  -  t  f  f  y/h(s,x)  dW(s,x). 

Jo  Jo  Jo  Jo 


Proof.  Note  that  y/n(A  -  Am)  =  ir (y/n[A  -  A) ),  where  *:  D2  — »  D2  defined  by  jt z)  ~f(t,  z)  - 
t  /(l,z)  is  continuous.  The  result  follows  immediately,  using  Theorem  3.1  and  the  continuous 
mapping  theorem. 


4-8.  Testing  for  proportionality 

Thomas  (1983)  introduced  the  model  a(t,z)  =  ai(t)“2(z)  f°r  the  conditional  hazard 
function  in  the  survival  analysis  context,  where  ay:  [0,1]  — ►  R,  j  =  1,2  are  unknown  functions. 
This  model  is  a  generalization  of  Cox’s  proportional  hazards  model  to  allow  for  arbitrary  co- 
variate  dependence  while  keeping  the  proportional  hazards  form.  In  this  section  we  introduce  a 
goodness-of-fit  test  for  Thomas’  model.  Note  that  this  is  not  the  same  as  a  goodness-of-fit  test  for 
Cox’s  proportional  hazards  model.  However,  Cox’s  model  can  be  treated  in  a  similar  fashion,  see 
McKeague  and  Utikal  (1988). 

Let  H0  denote  the  null  hypothesis  Ho :  there  exist  functions  ay:  [0,1]  — ►  R,  j  =  1,2  such 
that  a(t,z)  =  ai(t)a2(z)  for  all  t,z  €  [0,1].  In  order  that  ai  and  02  are  identifiable  we  impose 
the  condition  Aj(l)  =  1  under  Hq,  where  Aj(t)  =  /0*  ai(s)ds.  Equivalently,  we  could  impose  the 
condition  A2(l)  =  1,  where  A2(z)  =  /0*  <*2(1)  dx.  A  reasonable  estimator  for  A  under  H0  is 

A^[t,z)  =  Aj(t)  A2(z), 

where 

Aj  (t)  =  —  (with  1/0  =  0)  and  -^(z)  =  A(l,z). 

*(1,1) 

In  order  to  test  Ho  we  could  use  the  test  statistic  U ^  =  ■y/nsup<iZ  \  A(t,z)  -  >(^(t,z)  |.  As  before, 

once  we  know  the  asymptotic  distribution  of  y/n(A  -  A t)  we  can  derive  an  asymptotic  size  a  test 
for  H0  based  on  U^nh  This  test  is  an  omnibus  goodness-of-fit  test  for  proportionality  in  that  it  is 
consistent  against  any  alternative. 


THEOREM  4.5.  Under  the  conditions  of  Theorem  3.1,  if  Ho  holds  and  A2(l)  ^  0,  then 

y/n(A  -  A^)-^*m2 

in  D2  as  n  — »  oo,  where 

m3(t,z)=  f  f  s/h(s,x)dW(8tx)  -  0  A2(z)  [  [  y/h(a,  i)  dW(s,x) 

Jo  Jo  Jo  Jo 

-PM  (t)  [  [  \/h(s,x)  dW(s,  x)  +  P  Aj(t)  A2(z)  f  [  y/h(s,x)  dW{s,  x) 

Jo  Jo  Jo  Jo 

and  p  =  1/A2(l). 

Proof.  The  result  follows  readily  from  Theorem  3.1,  using  the  continuous  mapping  theorem  (cf. 
the  proof  of  Theorem  4.4)  and  the  identities 

(A  -  *t)(M)  =  (A  -  A)(t,z)  -  -  A(t,l)A(l,z)} 

=  O'  -  »(*,«)  -  wfjjKX'.V  -  >(i,*))  +  >(i,  *)(>((,  i)  -  /(f,i)) 

-M(«,1)(>(M)  -  >1(1.*)]  +  1.1)  -  >U,1)) 


+ 


^(1,1) -A(l,l)rA(t, !)»(!,*)  >t(M)A(l,z)i 


*(M) 


>1(1,1) 


*(1.1) 


REMARK.  Under  H0,  we  have  from  Theorem  3.1  that  A\  and  A2  are  uniformly  consistent  esti¬ 
mators  of  A\  and  A2,  respectively,  and 

y/n{A2  -  A2)-^m3  (4.9) 

in  £)[0, 1]  as  n  — ♦  oo,  where  m3  is  a  continuous  Gaussian  martingale  with  covariance  function 


rz\/\X2  ri 

Co  v(m3(z1),m3(z2))  =  /  /  h(s,x)dsdx. 

Jo  Jo 


This  could  be  used  to  obtain  confidence  bands  for  A2  under  Thomas’  model,  by  transforming  m3 
to  Brownian  bridge  (cf.  the  discussion  following  Proposition  3.2).  An  analogous  result  can  be 
obtained  for  A\. 
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5.  Technical  Lemma? 


P 


In  this  section  we  make  frequent  use  of  lemmas  from  McKeague  and  Utikal  (1987).  We  apply 
those  lemmas  by  changing  wn  to  d"1,  by  taking  Iz  =  Ir  when  z  £  Ir,  by  changing  Y^(s,z) 
to  Yr"^(s),  to  and  J^(s,z)  to  J$n\s),  where  Jrn\s)  =  7{Yrn^(s)  ^  0}.  Since 

ui <r<d*Ir  =  [0,1]  for  all  n,  the  set  C  in  A1-A3,  Bl,  B2  of  McKeague  and  Utikal  (1987)  can  be 
chosen  as  [0,1].  Results  quoted  from  McKeague  and  Utikal  (1987)  will  be  referred  to  as  Lemma 
A.l,  etc.. 

LEMMA  1.  Suppose  that  Al,  A2  and  B2  hold  and 

dn  =  o(nf)  for  some  S  e  (1/2,  1).  (5.1) 


Then 


E\\AP  -A  ||  =  0(0- 


Proof.  From  (3.8)  we  have 

ewk  -  a  ii = e  sup  £  r  f  -^4 da  e  ^ dx  -  r  r  «(•« z) ds  dx 

t.*  '~iJ0  Jo  Yry  ’(s)  Jo  Jo 


cJ-nHs)  I 

<  supy  E  '  -  a(s,i)  <  h  + 12, 

t,r,xelr  Yr  J(s)  ' 


where 


h  =  sup  E  ~  J<n)(s)a(s,z)  , 

»,r,xeJ,  Yr{  '(s) 

I2  =  sup  |a(s,z)|£7(l  -  Jr<n)(s)). 

r,t,x 

By  Lemma  A.5  we  have  Jj  =  0(d~1).  Next,  by  Lemma  A .4  we  have  I2  =  0(exp{-n  if  d"1}), 
where  K  =  inft>*  fz(t)Y(t){z,  !)•  Thus,  since  exp{-nifd“1}  =  0(d„/n)k  for  all  nonnegative 
integers  k  and  (d„/n)*  =  0(l/dn)  for  k  sufficiently  large  by  (5.1),  we  have  I2  =  Ofd'1).  This 
completes  the  proof. 


Proof  of  tightness 


First  note  that 


d<Mjn>),  =  £/{Z,(s)  £  Ir}Yi{s)d(Mi). 


(5.2) 


»=i 


and,  since  Mrn\  r  =  1, . . .  ,dn  are  orthogonal  martingales, 


!*<*„ 


G  fri  (yrw(.))* 
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(5.3) 
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Tightness  of  {M^ ,  n  >  1}  in  Z?2  will  be  shown  by  establishing  a  product  moment  condition 
on  the  increments  of  M^  over  the  grid  T*")  =  [0, 1]  X  {0,  1  /d„,  2/dn, . . . ,  1}. 


For  fixed  0<s<l,  0<x<y<l,  define  the  martingale 

Mi(t)  =  M<n)((s,t]  x  (x,y)),  t  >  s, 

and  denote  mj  =  M\  —  (Mi). 


LEMMA  2.  Suppose  that  Al,  A2  hold,  dn  =  o(n)  and  X  is  either  a  continuous  process  or  a  counting 
process.  Then  there  exists  a  positive  constant  K  such  that  for  all  n  >  1,  (s,x),  (t,y)  6  T<n) 


E(Mi >2  <K(t-  s)2(y  -  x)2, 
E  m2(t)  <  K  (t  —  s)(y  —  x)2. 


(5.4) 

(5.5) 


Proof.  By  (5.3) 


2  rt  ,  rt  i 

J-  (vA’,W)a  '•  (n'r’W)2  ’ 


By  (2.2)  there  exists  a  positive  constant  Jjfj  such  that  d(M)t/da  <  K\  for  all  0  <  s  <  1  .  Therefore, 
by  (5.2),  d(Mr(n))./ds  <  Ai  Yr  (s).  Applying  Fubini’s  theorem  we  obtain 


£  [*  - d(M^)v  f  —r^ - d(M<n>) 

j •  (Yr{?](v)  )2  ri  y.  (Yr,  (v)  )2  ’ 


(*V/»)2 


<-«r.r. 


rtr'WYff’M 


(*»)/ 


dv\dv2. 


But  by  Lemma  A.3 


sup£[  1  1 *  =  o(±)\ 

...  iyr("i(,)l  V  n  > 


This  proves  (5.4).  Now  we  turn  to  the  proof  of  (5.5).  First  we  need  to  obtain  an  explicit  expression 
for  mi .  Integration  by  parts  gives 


M2(t)  =  2  J*  Mi(v—)  dMi(v)  +  [Mi]t. 

In  the  case  that  X  has  continuous  sample  paths  [Afi]  =  (Mi).  In  the  counting  process  case 

M  J,=  £ 


where  A Mi(v)  =  Mi(v)  -  Mi(v-)  is  the  jump  in  Mi  at  time  v,  so  from  (3.6),  (3.9) 


|y  n|  r ‘  i 

n  r=|adn]+l’/* 


) 


:|adn]  + 

=  (Mi)t  +  TJt, 
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mi(t)  =  2  J  Mi(v~)  dMi(v)  +  rjt , 


where  in  the  continuous  sample  path  case  T)t  is  zero.  From  this  expression  we  get 


Em\[t)  <  8  eJ*  M?(v~)  d{M1)v  +  2  E(r})t. 


In  order  to  obtain  an  upper  bound  on  the  first  term  on  the  r.h.s.  of  (5.6)  we  shall  use  the  Burkholder* 
Davis-Gundy  inequality  (see  Dellacherie  and  Meyer,  1982,  p.287) 

E  sup  M}(v)  <  K  E[M i]2,  (5.7) 

where  here,  and  in  what  follows,  If  is  a  generic  positive  constant  which  is  independent  of  n.  Then, 
by  orthogonality  of  the  martingales  Mr  ,  r  =  1, . . . ,  dn> 

if  .up 

<  tk  (»-*)(»  -  »)(£[w.]f)*(«»p£  (i/yr'">w  )2)> 

an  r,v 

(by  (5.7)  and  the  Cauchy  -  Schwarz  inequality) 

<  K  (y  -  x)(t  -  s)(E  [Mi]2)*,  (5.8) 

by  Lemma  A.3.  Now  [Mi]  =  (Mi)  +  r/,  so 


E  [M,]2  <  2  2?(Mi)t3  +  2  E(rj)t  <  K  (t  -  s)2(*  -  y)2  +  2  E(r,)t 


by  (5.4).  Also 


Iv^l  rt  , 

E <”>‘  =  p  £  *  J  — ^y—— rf(A^r(n))« 

r=|*<iR]  +  l  J '  (^r  »)4 

<^dn(y~  x)(t  -s)K  sup  E 

an  r,v  xyr'  '(y)' 

<  K—  ^  {t  —  s)  (by  Lemma  A.3) 

n  dn 

<  K  (y  -  x )2(t  -  s) 


(5.10) 


since  dn  =  o(n)  and  y  -  x  >  1/d,,  if  z  ^  y.  The  desired  inequality  is  now  obtained  directly  from 
(5.6),  (5.8)-(5.10). 


I 


¥ 

I 


I 
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LEMMA  3  (Tightness).  Suppose  that  Al,  A2  hold,  dn  =  o(n)  and  X  is  either  a  continuous  process 
or  a  counting  process.  Then  {M^n\  n  >  1}  is  tight  in  D2. 

Proof.  Consider  the  following  increments  of  M^  over  neighbouring  rectangles  in  [0,  l]2.  Define 
Mi  as  before  and  ^ 

M2[t)  =  M(n>  ( («,  t] )  x  (y,  z] ), 

Ms(y)  =  M(n)  ( (t,  u] )  X  (x,  y]  ), 

where  0<s<t<u<l,  0<x<y<z<l.  Suppose  that  the  corner  points  of  the  rectangles 
belong  to  .  Also,  denote  n\  =  Mf  —  (Mi),  i  =  1,  2,  3.  From  the  representation  of  mi  in 
the  proof  of  Lemma  2  it  can  be  seen  that  mx  and  m2  are  orthogonal  martingales.  Thus,  using  the 
Cauchy-Schwarz  inequality  and  Lemma  2,  we  get 

E  M2(t)  M2(t)  =  E(Mi)t(M2)t  +  E  rni(t)(M2)t  +  E  m2(t)(M\)t  +  E  mi(t)  m2(t) 

<  (t-«)*(v -*)(*-»)•  (5.11) 

Next,  by  the  martingale  property  of  m3,  we  have 

E  Mi  (t)  M|(u)  =  E(Ml(t)  E(Ml(n)\7t)  )  =  E  (M?(t)(M3)u) 

=  E  mi (t)(M3)u  +  E(Ml)t(Mz)u, 

so  that,  again  using  the  Cauchy-Schwarz  inequality  and  Lemma  2,  we  obtain 

E  Mi  (t)  M|(u)  <  K  (y  —  x)2(t  -  s)^  (u  —  t).  (512) 

The  inequalities  (5.11)  and  (5.12)  imply  that  “condition  (P,  7)”  of  Bickel  and  Wichura  (1971, 
p.1658)  is  satisfied  with  P  =  3/2,  7  =  4,  for  rectangles  whose^corner  points  lie  in  T^n\  Clearly 
becomes  dense  in  [0,  l]2  as  n  grows  large.  Moreover,  M^  (t,  z )  is  constant  as  a  function 
of  z  over  each  interval  =  [(r  -  l)/d„,  r/dn),  r  =  l,...,d„,  so  the  modulus  of  continuity 
ojf  (M^n))  defined  in  Bickel  and  Wichura  can  be  computed  using  instead  of  [0,  l]2.  Tightness 
of  (M*n),  n  >  1}  now  follows  from  the  remarks  following  Theorem  3  of  Bickel  and  Wichura  (1971, 
p.1665). 


Convergence  of  finite  dimensional  distributions 


Recall  the  notation  H(t,z)  =  f*f*  h(s,x)dxds. 

LEMMA  4.  Suppose  that  A1-A3,  B1  hold  and  dn  =  o(n).  Then 

sup|(M(n)(-,z))t  -  H(t,z)  |^0. 


h 

& 

t'v. 


Proof  from  (2.2)  and  (5.2)  we  have  d(M^),  =  ]Cr=i  ^{^»  (s)  e  Jr}  Vi(s)  7(s,  Zi[s),  1 )  ds.  By 
continuity  of  7(7 -,1) 

I{Zi(s)  e  Jr}7(»,Sr(»),  1)  =  €  Ir}  (7(*,4n),l)  +  0(1))  (5.13) 

and  d(Mln]),  =  y/”*(e)  (7(s,  x[n\  1)  +  o(l) )  ds  for  arbitrary  G  Jr  uniformly  in  r  =  1  ,...,dn 
and  8  €  [0, 1].  Therefore  by  (5.3) 
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_  1  /**  n  t 

Thus,  since  h  is  continuous, 

£;sup|(M(n)(.,z))t-/r(t,2)| 

t,z 

-T„%e  /„U^(7(s'4",'1)+o(1»- M*.4">)|  *+»(!) 

<  A  +  li  +  <>(l), 


where 


It  follows  from  Lemma  A.3  that  7i  — ►  0.  From  (5.13)  we  have  that  7!"*(s)  =  Yr(n)(s)  (7(8,1!"*,  1)  + 
o(l)),  where  7! n)(s )  =  £"=1  /(.Z,(s)  €  Jr}  ^(s)  7(8,  ^(s),  1).  Therefore 

7(g,x!n),l)  _  7!n)(«)  ,  1  m 

n(n*(«)  (y,("’(*))2  yr(n)(s) 

Application  of  Lemma  A.6  yields  J2  — *  0.  This  proves  the  lemma. 

LEMMA  5.  Suppose  Al,  A2  hold  and  dn  —  o(n).  Then,  if  X  is  a  counting  process,  the  Lindeberg 
condition  (3.10)  is  satisfied. 

Proof.  By  (2.2)  and  (5.2)  it  suffices  to  show  that 

=  >«}*-*  0. 

Jo  n(n)(8)  Yjn)(s)  > 

But,  by  the  Cauchy-Schwarz  inequality,  Lemma  A.3  and  Chebychev’s  inequality 

which  proves  the  lemma. 

We  shall  make  use  of  the  following  notation: 

■»!*’(•>  =  =  E /{«(»)  e  i>i;(.)7(..*(.),i), 

•=l 

7^(8)  =  x>!n>(«). 


LEMMA  6.  Suppose  that  A1-A3,  B1  hold  and  dn  =  o(n).  Then 

”pg["cf£w  ~h  «<*'*>‘fa],  =  o0- 

Proof.  By  the  Cauchy-Schwarz  inequality  and  Lemma  A.3  (with  w„  =  1) 

El"(mwy  - 1  !(s’x)i*\ 


4^1/2 


where 


5  {Bl¥^]y12  {Elb,n){s)  -  (i^Y l <*>»<•.  *H‘} 

<K{it(.)  +  h(.)yi\ 

I1(a)  =  E[Wn\s)~  ^  p2(fl)ff(s,i)di]  , 

l2{s)  =  E[p{8)-rm}\ 

Now  7(n)(s)  is  a  sum  of  i.i.d.  r.v.’s,  each  of  which  is  uniformly  bounded  in  s  and  has  expectation 
fo  p2(s)g(s,x)  dx.  Thus  sup,  7i(s)  =  0(l/n2).  Similarly  sup,  12(a)  =  0(l/n2).  This  completes 
the  proof. 

LEMMA  7.  Suppose  that  A1-A3  hold  and  dn  =  o(n).  Then 

[*^*1  (n)  /  \  . 

sup  E  -  ^  .  y  ^r.  }8 - —  I  j(a,x,  l)dx  — >0. 

J  rtt  VrH(s)  P{*)Jo  n  *  I 

Proof.  Using  the  Cauchy-Schwarz  inequality  and  Lemma  A.3,  as  in  the  proof  of  Lemma  6,  we  see 
that  it  suffices  to  show  that  7i  — *  0  and  7-?  — »  0,  where 


7j  =  sup  E 


•t  ly^nl  (fl)  /  \  rz  _  *■ 

/  l(s,Tti)dx\ 
dn  rtt  Yr[n)(a)  Jo  n  1  J 


h  =  sup  E  [-r(n)(s)  -  p(s)]2. 

*  n 

Since  7(-,*,l)  is  continuous,  /0*  7(s,z,l)dz  =  d”1  7(5, zr,  1)  +  o(l)  uniformly  in  a  and  z, 

where  xT  €  7,  is  arbitrary.  Thus 


7i  <  sup  E 


*■>* 


iin|M 


-7(s,zr,l)  +0(1) 


which  tends  to  zero  by  Lemmas  A.4,  A.5  (with  a  replaced  by  7).  Finally,  72  — ►  0  by  the  proof  of 
Lemma  6. 


1 


k 


a 


a 


jp 

<r." 
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LEMMA  8.  Suppose  that  Al,  A2  hold  and  dn  =  o(n).  Then 

(a)  {M)t 4  f*  JjJ  g  (8,  i)  dx  ds ; 

(b)  f’  'y(s>x,l)dxds. 

Proof.  Using  the  orthogonality  of  the  martingales  M$n\  r  —  1, . . .  ,dn,  we  have 


!*<*»!  /*t  (n)/'  \ 

=  f  f~  [  ,  ?  M - d». 


y*n,(8)y(»)(s) 

Parts  (a)  and  (b)  the  follow  immediately  from  Lemmas  6  and  7,  respectively. 

Estimation  of  H,  h  and  g 

LEMMA  9.  Suppose  that  A1-A3,  B1  hold,  X  is  a  counting  process  and  dn  =  o(n).  Then 

sup  |  H(t,z)  -  H[t,z)  |“+0. 
t,* 

Proof  From  (3.4)-(3.6)  and  (5.3)  we  obtain  H  -  H  =  Ix  +  /2,  where 

_  l**»|  rt  | 

^nfr'Jo  {Yr(n){s)y  r  1  h 

I2{t,z)  =  (MW(.,z))t-H(t,z). 

From  Lemma  4  we  have  that  supt ,  |/3(t,z)  |-»0.  By  Doob’s  inequality,  (5.2),  (2.2),  (2.3)  and 
Lemma  A.3  we  get 

£sup/f(t,z)  <  4^t-  V' e[  — j— l - d(M^)t 

J  d*friJ0  (yr<»)(a))4  '  r  " 

<  4 -~g  sup a(s, z)  sup  E — — ^ - =  0(3)3)  o(— )  =  o(-M. 

<pn  .X  tX  (yr(n)(s)  )3  \n) 


LEMMA  10.  Suppose  that  X  is  a  counting  process,  the  assumptions  of  Theorem  3.1  hold,  dnb%  — * 
oo  and  K  is  Lipschitz.  Then 

(a)  Efofo  l*(M)  ~  h(t,z)\2dtdz  —  0; 

(b)  under  the  null  hypothesis  Ho  of  Section  4.1,  E  f^f^  \ h(t,z)  -  h(t,z)\2  dtdz  — ►  0  and 
Efofo  \g(t,z)~  9{t,z)\2dtdz-r0. 

Proof.  We  shall  prove  (a);  the  proof  of  (b)  is  similar.  From  (3.6)  and  the  definition  of  h,  we  can 
write 

h  -  h  =  (h  -  h)  +  (h  -  ftt)  +  (fct  _  h *)  _  Rj  (5.14) 


where 


'h^  =  kLLK^7)K{iT?)hMisix' 

h-{,,z)  =  -±-f'K(z- =^)  “ffll-rf., 

*>n<*n  "  '  6«  '  ^0  '  fcn  '  dn  (yr(n,(S))2 

£(*,*)  =  ITn  f1  KC~T1)- TT - <*Af<n)(s), 

v  J  '  in  )  Jo  V  i«  ;(Fr(n)(e))2 


n(A(t,  2)  -  A(t,  z))2  dt  dz  — *•  0. 


and  zr  =  r/d„.  Now  let  us  treat  each  term  in  (5.14)  separately.  First,  since  A  is  continuous, 

7‘ 

/o  ^0 

Secondly,  since  A  is  continuous  and  K  is  Lipschitz, 

sup|A(t,z)  -  ht(*>z)l  <  r-sup|  f  K(^-r-^)  h{ayx)dx  -  ^~Y^k(Z  -*-)  h(s,xr) 

t,x  bn  ,,,  'Jo  '  ®n  7  dn  "  \  »„  / 

s  ctpIE^Ktt)  •  *  (rir)  l‘ ^  }+<’(1) 

Thirdly,  using  Lemma  A.6  and  the  assumption  that  K  has  compact  support, 

supiSjAty,*)  -  h*(t,z)\2  < 


t,x 


m"?  t/o  fr(xi) *]’ “p [E'c(V1)]2s.uf£l'l(*’I') - 


*t<e 


0(6n)20(&ndn)20(l)  -  0. 


Finally,  using  Lemma  A.3, 

Sup£!B(t,z)i2  =  ^8uP{y;x2(i^)  /1^(ifi)Ef  f’1‘1 U} 
*.<  1  V  "  \  bn  J  Jo  \  bn  J  l(n(n,(s))4j  ' 


This  completes  the  proof. 
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